On Sharp Identification Regions for Regression Under Interval Data

نویسندگان

  • Georg Schollmeyer
  • Thomas Augustin
چکیده

The reliable analysis of interval data (coarsened data) is one of the most promising applications of imprecise probabilities in statistics. If one refrains from making untestable, and often materially unjustified, strong assumptions on the coarsening process, then the empirical distribution of the data is imprecise, and statistical models are, in Manski’s terms, partially identified. We first elaborate some subtle differences between two natural ways of handling interval data in the dependent variable of regression models, distinguishing between two different types of identification regions, called Sharp Marrow Region (SMR) and Sharp Collection Region (SCR) here. Focusing on the case of linear regression analysis, we then derive some fundamental geometrical properties of SMR and SCR, allowing a comparison of the regions and providing some guidelines for their canonical construction. Relying on the algebraic framework of adjunctions of two mappings between partially ordered sets, we characterize SMR as a right adjoint and as the monotone kernel of a criterion function based mapping, while SCR is indeed interpretable as the corresponding monotone hull. Finally we sketch some ideas on a compromise between SMR and SCR based on a set-domained loss function. This paper is an extended version of a shorter paper with the same title, that is conditionally accepted for publication in the Proceedings of the Eighth International Symposium on Imprecise Probability: Theories and Applications. In the present paper we added proofs and the seventh chapter with a small Monte-Carlo-Illustration, that would have made the original paper too long.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Sharp Identification Regions in Models with Convex Moment Predictions by Arie Beresteanu,

We provide a tractable characterization of the sharp identification region of the parameter vector θ in a broad class of incomplete econometric models. Models in this class have set-valued predictions that yield a convex set of conditional or unconditional moments for the observable model variables. In short, we call these models with convex moment predictions. Examples include static, simultan...

متن کامل

Multiple Fuzzy Regression Model for Fuzzy Input-Output Data

A novel approach to the problem of regression modeling for fuzzy input-output data is introduced.In order to estimate the parameters of the model, a distance on the space of interval-valued quantities is employed.By minimizing the sum of squared errors, a class of regression models is derived based on the interval-valued data obtained from the $alpha$-level sets of fuzzy input-output data.Then,...

متن کامل

A Consistent Estimator for Uniform Parameter Under Interval Censoring

‎The censored data are widely used in statistical tests and parameters estimation‎. ‎In some cases e.g‎. ‎medical accidents which data are not recorded at the time of occurrence‎, ‎some methods such as interval censoring are used‎. ‎In this paper‎, ‎for a random sample uniformly distributed on the interval (0,θ) ‎the interval censoring have been used‎. ‎A consistent estimator of θ  and some asy...

متن کامل

Bounds on Causal Effects in Three-Arm Trials with Non- compliance

This paper considers the analysis of three-arm randomized trials with noncompliance. In these trials, the average causal effects of treatments within principal strata of compliance behavior are of interest for better understanding the effect of the treatment. Unfortunately, even with usual assumptions, the average causal effects of treatments within principal strata are not point-identified. Ho...

متن کامل

Computation of Bounds on Population Parameters When the Data Are Incomplete

This paper continues our research on the identification and estimation of statistical functionals when the sampling process produces incomplete data due to missing observations or interval measurement of variables. Incomplete data usually cause population parameters of interest in applications to be unidentified except under untestable and often controversial assumptions. However, it is often p...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013